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Abstract 

The practicality of the stochastic network calculus (SNC) is often questioned on grounds of potential looseness of its 
performance bounds. In this paper it is uncovered that for bursty arrival processes (specifically Markov-Modulated On- Off 
(MMOO)), whose amenability to per-flow analysis is typically proclaimed as a highlight of SNC, the bounds can unfortunately 
indeed be very loose (e.g., by several orders of magnitude off). In response to this uncovered weakness of SNC, the (Standard) per- 
flow bounds are herein improved by deriving a general sample-path bound, using martingale based techniques, which accommodates 
FIFO, SP, EDF, and GPS scheduling. The obtained (Martingale) bounds gain an exponential decay factor of O (e _an ) in the 
number of flows n. Moreover, numerical comparisons against simulations show that the Martingale bounds are remarkably accurate 
for FIFO, SP, and EDF scheduling; for GPS scheduling, although the Martingale bounds substantially improve the Standard bounds, 
they are numerically loose, demanding for improvements in the core SNC analysis of GPS. 

I. Introduction 



Several approaches to the classical queueing theory have emerged over the past decades. For instance, matrix analytic methods 
t^. ' (MAM) not only provide a unified treatment for a large class of queueing systems, but they also lend themselves to practical 
^ — 1 . numerical solutions; two key ideas are the proper accounting of the repetitive structure of underlying Markov processes, and 
' the use of linear algebra rather than classic methods based on real analysis (see Neuts ll35l and Lipsky PD ). Another unified 
["t | approach targeting broad classes of queueing problems is the stochastic network calculus (SNC) (see Chang |7| and Jiang and 
' Liu |23l ), which can be regarded as a mixture between the deterministic network calculus conceived by Cruz iTPfl (see Le 
^ , Boudec and Thiran ]3)) and the effective bandwidth theory (see Kelly |25l ). Because SNC solves queueing problems in terms 
O ' of bounds, it is often regarded as an unconventional approach, especially by the queueing theory community. 

MAM and SNC could be (slightly) compared by the way they apply to queues with fluid arrivals. In their simplest form, 
' fluid arrival models were defined as Markov-Modulated On-Off (MMOO) processes by Anick, Mitra, and Sondhi Q], and were 
J> , significantly extended thereafter, especially for the purpose of modelling the increasingly prevalent voice and video traffic in 
""xl" ' the Internet. By relating fluid models and Quasi-Birth-Death (QBD) processes, Ramaswami has argued that MAM can lend 
themselves to numerically more accurate solutions than spectral analysis methods ||39l . In turn, SNC can produce alternative 
' solutions with negligible numerical complexity, but these are arguably less relevant than exact solutions (simply because they 
• , are expressed as bounds). What does, therefore, justify more than two decades of research in SNC? 

The answer lies in two key features of SNC: scheduling abstraction and convolution-form networks (see Ciucu and Schmitt Ifl2l ). 
, The former expresses the ability of SNC to compute per-flow (or per-class) queueing metrics for a large class of scheduling 
7—1 ' algorithms, and in a unified manner. Concretely, given a flow A sharing a queueing system with other flows, the characteristics 
, of the scheduling algorithm are first abstracted away in the so-called service process; thereafter, the derivation of queueing 
metrics for the flow A is scheduling independent. Furthermore, the per-flow results can be extended in a straightforward manner 
from a single queue to a large class of queueing networks (typically feed-forward), which are amenable to a convolution- form 
representation in an appropriate algebra. 

By relying on these two features, SNC could tackle several open queueing networks problems. The typical scenario involves 
the computation of end-to-end (e2e) non-asymptotic performance bounds (e.g., on the delay distribution) of a single flow 
crossing a tandem network and sharing the single queues with some other flows. Such scenarios were solved for a large 
class of arrival processes (see, e.g., Ciucu et al. IfTTI . Q and Fidler |20l for MMOO processes, and Liebeherr et al. |29l 
for heavy-tailed and self-similar processes). Another important solution was given for the e2e delay distribution in a tandem 
(packet) network with Poisson arrival and exponential packet sizes, by circumventing Kleinrock's independence assumption 
on the regeneration of packet sizes at each node (see Burchard et al. (4)). 

Based on its ability to partially solve fundamentally hard queueing problems (i.e., in terms of bounds), SNC is justifiably 
proclaimed as a valuable alternative to the classical queueing theory (see Ciucu and Schmitt [12|). At the same time, SNC 
is also justifiably questioned on the tightness of its bounds. While the asymptotic tightness generally holds (see Chang Q, 
p. 291, and Ciucu et al. 01|), doubts on the bounds' numerical tightness shed skepticism on the practical relevance of SNC. 
This skepticism is supported by the fact that SNC largely employs the same probability methods as the effective bandwidth 
theory, which was argued to produce largely inaccurate results for non-Poisson arrival processes (see Choudhury et al. ||9l). 
Moreover, although the importance of accompanying bounds by simulations has already been recognized in some early works 
(see Zhang et al. B31l for the analysis of GPS), the SNC literature is scarce in that respect. 



A 2-page extended abstract of this paper will appear at ACM Sigmetrics 2013. 
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In this paper we reveal what is perhaps 'feared' by SNC proponents and expected by others: the bounds are very loose 
for the class of MMOO processes, which is very relevant as these can be tuned for various degrees of burstiness. In addition 
to providing numerical evidence for this fact (the bounds can be off by arbitrary orders of magnitude), we also prove that 
the bounds are asymptotically loose in most multiplexing regimes. Concretely, we (analytically) prove that the bounds are 
'missing' an exponential decay factor of O (e~ an ) in the number of flows n, where a > 0; this missing factor has been 
indicated through numerical experiments in Choudhury et al. (9] in the context of effective bandwidth bounds (which scale 
identically as the SNC bounds). 

While this paper convincingly uncovers a major weakness in the SNC literature, it also shows that the looseness of the 
bounds is generally not inherent in SNC but it is due to the 'temptatious' but 'poisonous' elementary tools from probability 
theory leveraged in its application. In this sense, we prove that by leveraging more advanced tools (i.e., martingale based 
techniques), the SNC bounds improve dramatically to the point that they almost match simulation results. Concretely, we show 
these improvements to hold for per-flow delay bounds in FIFO, SP, and EDF scheduling scenarios with MMOO flows. We 
also show that, in the case of GPS scheduling, even the significantly improved bounds are loose. Overall, this paper provides 
convincing evidence that the core analysis in SNC, being reminiscent from the deterministic network calculus, is tight for 
FIFO, SP, and EDF, but it needs to be revisited for GPS. 

The sharp bounds obtained in this paper are the first in the conventional stochastic network calculus literature (i.e., involving 
service processes). Their significance, relative to existing sharp bounds in the effective bandwidth literature (e.g., Duffield lfl8l 
and Chang (7|, pp. 339-343, using martingale inequalities, or Liu et al. ll32l by extending an approach of Kingman involving 
integral inequalities j27)), is that they apply at the per-flow level for various scheduling; in turn, existing sharp bounds only 
apply at the aggregate level. In other words, our sharp bounds generalize existing ones by accounting for scheduling. 

A weakness of our results, from a purely network calculus perspective, is that they are restricted to a specific class of 
processes, i.e., MMOO; we point out that one of the conceptual promises of the SNC is to provide general bounds for much 
broader classes. While we thus deliberately sacrifice this conceptual generality, we also advocate a conceptual shift in running 
the SNC. Concretely, based on the results obtained in this paper, we believe that 1) SNC must be coupled with the mainstream 
queueing literature, in particular by "getting a firm grip on arrivals", and 2) the main two features of SNC (i.e., dealing with 
scheduling and multi-node) must be carefully leveraged in order to obtain sharp bounds. 

The rest of the paper is structured as follows. In Section [EI] we identify, at an intuitive level, the elementary tool from 
probability theory which is 'responsible' for the very loose (Standard) bounds in SNC. In Section [Til] we describe the queueing 
model and some necessary SNC formalisms. The core of the paper is Section [TV] which computes the improved (Martingale) 
and reviews the existing (Standard) SNC per-flow delay bounds in multiplexing scenarios with MMOO flows; both analytical 
and numerical comparisons of the bounds are further explored. Concluding remarks are presented in Section [V] 



This section overviews the SNC bounding approach to compute per-flow queueing metrics for broad classes of arrivals and 
scheduling. In addition to identifying three major steps in this approach, it is conveyed by means of a simple example that 
careless bounding can lend itself to impractical results. 

Towards this end, we consider a simplified queueing systems in which a (cumulative) arrival process A(t) shares with some 
other flows a server with capacity C and infinite queue length. We are particularly interested in the complementary distribution 
of A(t)'s backlog process B{t), which is bounded in SNC for some t,a > according to 



Here, A(s,t) := A{t) — A(s) is the bivariate extension of A(t), whereas S(s,t) is another bivariate process, called a service 
process, encoding the information about the server, the scheduling, and the other arrival processes that A(t) shares the server 
with. In the simplest setting with no other arrivals, S(s,t) = C(t — s) and Eq. (HJ (with equality) recovers Reich's equation. 
In another setting in which A(t) receives the lowest priority, should the server implement a static priority (SP) scheduler, then 
S(s,t) = Git — s) — A c (s,t), where A c (s,t) denotes the other arrivals at the server. 
Eq. ([T]i typically continues in SNC by invoking the Union Bound, i.e., 



The probability events can be further computed either by 1) convolving the distribution functions of A(s, t) and S(s, t), when 
available, and under appropriate independence assumptions, or by following a more elegant procedure using the Chernoff 
bound, i.e., 



II. Three Bounding Steps in SNC and One Pitfall 
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(2) 
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(3) 



s=0 



3 



for some 6 > 0. The expectation can be split into a product of expectations, according to the statistical independence properties 
of A(s,t) and S(s,t), and the sum can be further reduced to some canonical form. 

Eqs. (|TJ-<[3j outline three major bounding steps. The first is 'proprietary' to SNC, in the sense that it involves the unique 
construction of a 'proprietary' service process S(s,t) which decouples scheduling from analysis. The next two follow general 
purpose methods in probability theory, which are applied in the same form in the effective bandwidth theory, except that S(s, t) 
is now a random process rather than a constant-rate function. 

The second step in particular reveals a convenient mathematical continuation of Eq. ([TJ. The reason for this 'temptatious' 
step to be consistently invoked in SNC stems from the 'freedom' of seeking for bounds rather than exact results. As we will 
show over the rest of this section, and of the paper, this 'temptatious' step is also 'poisonous' in the sense that it lends itself 
to very loose bounds for a class of queueing scenarios which is being proclaimed in SNC as a highlight of its scope. 

To convey insight into this direction, let us consider the stationary but non-ergodic process 

A(s, t) = (t - s)X V0 < s < t , (4) 

where X is a Bernoulli random variable taking values in {0, 2}, each with probabilities 1 — e > .5 and e > 0. Assume also 
that S(s,t) = t — s. Clearly, for a > and for sufficiently large t, the backlog process satisfies 

F(B(t) ><t) = e . 

In turn, the application of the bound from Eq. (f2) lends itself to a bogus bound, i.e., 

¥(B(t) > a) <et , 

for a < 1 (for a > 1, the bound diverges as well). The underlying reason behind this bogus result is that the Union Bound 
from Eq. (O is agnostic to the statistical poperties of the increments of the arrival process A(s, t). 

The construction of A(s,t) from Eq. is meant to convey insight into the poor performance of the Union Bound for 
arrivals with correlated increments, such as MMOO processes. Within the same class, another relevant arrival process is the 
fractional Brownian motion which has long-range correlations; the analysis of such process was done either by approximations 
(e.g., Norros 11361 ) or by using the Union Bound (e.g., Rizk and Fidler [40 1). The rest of the paper will unequivocally reveal 
that the Union Bound leads to very loose per-flow bounds for MMOO processes. 

The Union Bound can however lend itself to reasonably tight bounds when X s := A(s,t)'s are rather uncorrected (see 
Talagrand l44l ). Shroff and Schwartz 11421 argued that the effective bandwidth theory yields reasonable bounds only for Poisson 
processes. Moreover, Ciucu iflOl provided numerical evidence that SNC lends itself to reasonably tight bounds for Poisson 
arrivals as well. 



III. Queueing Model 

This section introduces the queueing model and necessary SNC formalisms. The time model is continuous. Consider a 
stationary (bivariate) arrival process A(s, t) defined as 

A(s,t):=f a{u)du V0 < s < t , A(t) := A(Q, t) , 

J U — S 

where a(s) Vs > is the increment process. 

According to Kolmogorov's extension theorem, the one-side (stationary) process {a(s) : < s < oo} can be extended to 
a two-side process {a(s) : — oo < s < oo} with the same distribution. For convenience, we often work with the reversed 
cumulative arrival process A T (s,t) defined as 

A r (s,t)=f a(-u)du V0 < s < t , A r (t) := A'(0, t) . 

J U—S 

This definition is identical with that of A(s,i), except that the time direction is reversed. 

Working with time reversed processes is particularly convenient in that the steady-state queueing process (say in a queueing 
system with constant-rate capacity C fed by the one-side increment process <x(s)) can be represented by Reich's equation 



Q = sup{A r (i) - Ct} 



The evaluation of Q needs an additional stability condition, e.g., limsup^^ — ^ < C a.s. (see Chang JT), pp. 293-294); this 
condition is fulfilled by the (stronger) Loynes' condition, i.e., o(s) is also ergodic and lim t _ i . 00 = E [a(l)] < C a.s. 

In this paper we mostly consider the queueing system depicted in Figure [TJ Two cumulative arrival processes A\ (t) and 
A 2 (t), each containing n\ and n 2 sub-flows, are served by a server with constant-rate C = nc, where n = ri\ + n 2 - The 
parameter c is referred to as the per-(sub)flow capacity, and will be needed in the context of asymptotic analysis. For clarity, 
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Fig. 1. A queueing system with two arrival processes A\(t) and ^(i), each containing n± and ri2 sub-flows. The server has a capacity C = nc, where 
n = ri\ + «2- We are interested in the delay distribution of A\(€). 

A\ (i) and (i) will also be suggestively referred to as the through and cross (aggregate) flows, respectively. The data units 
are infinitesimally small and are referred to as bits. The queue has an infinite size capacity, and is assumed to be stable. 
The performance measure of interest is the virtual delay process for the (through) flow Ai(t), defined as 

Wi(t) :=inf{d>0: A x {t - d) < D x {t)} Vf > , 

where D\(t) is the corresponding departure process of A\(t) (see Figure [U. The attribute virtual expresses the fact that Wi(t) 
models the delay experienced by a virtual data unit departing at time t. Note that W\ (t) is the horizontal distance between 
the curves A\(t) and Di(t), starting backwards from the point (t,Di(t)) in the Euclidean space. 

In stochastic network calculus, queueing performance metrics (e.g., bounds on the distribution of the delay process W\{t)) 
are derived by constructing service curve processes, which relate the departure and arrival processes by a (min, +) convolution. 
For instance, in the case of A\(t) and D\(t), the corresponding service process is a stochastic process S\{s,t) such that 

Di(t) >A l *S 1 (t) V* >0 , (5) 

where "*" is the (min, +) convolution operator defined for all sample-paths as A\ * Si(t) := info< s <t {Ai(s) + Si(s,t)}, 

The service process Si(s, t) typically encodes the information about the cross aggregate A%{t) and the scheduling algorithm; 
other information such as the packet size distribution is omitted herein in accordance to the infinitesimal data units assumption. 
Conceptually, the service process representation from Eq. (0 encodes A\{t)\ own service view, as if it was alone at the 
network node (i.e., not competing for the service capacity C with other flows). Although the representation is not exact due 
to the inequality from Eq. (0, it suffices for the purpose of deriving upper bounds on the distribution of W\(t). The driving 
key property is that Eq. (|5]l holds for all arrival processes A\{t). Due to this property, the service representation in SNC is 
somewhat analogous with the impulse-response representation of signals in linear and time invariant (LTI) systems (see Ciucu 
and Schmitt iTPH for a recent discussion on this analogy). 

In this paper we will compute the distribution of the through aggregate's delay process Wi(t) for four distinct scheduling 
algorithms at the server, i.e., First-In-First-Out (FIFO), Static Priority (SP), Earliest-Deadline-First (EDF), and Generalized 
Processor Sharing (GPS). The enabling service process Si(s,t) for the delay computations, for each of scheduling algorithms, 
will be presented in Section HV-AI 

IV. SNC Bounds for MMOO Processes 

In this section we consider the queueing scenario from Figure [T] in which the sub-flows comprising A\ (t) and A2 (t) are 
Markov-Modulated On-Off (MMOO) processes. Because such processes can be tuned for various degrees of burstiness, they 
are particularly relevant for both modelling purposes and testing the tightness of related performance bounds. 

After defining the MMOO processes, we derive Martingale bounds for the distribution of Wi(t) for FIFO, SP, EDF, and 
GPS scheduling. Then we overview the corresponding Standard bounds in SNC. Lastly, we compare these bounds both 
asymptotically, as well as against simulations. 




P 



Fig. 2. A Markov-modulated On-Off (MMOO) process 

Each MMOO sub-flow is modulated by a continuous time Markov process Z(t) with two states denoted by and 1, and 
transition rates \i and A as depicted in Figure [2] The cumulative arrival process for each sub-flow is defined as 

A'(s, t) := I Z{u)Pdu VO < s < t , A'(t) := A'(0, t) , (6) 

J 1L — S 

where P > is the peak rate. In other words, A'(t) models a data source transmitting with rates and P while Z(t) delves 
in the and 1 states, respectively. The steady-state 'On' probability is p := jj— and the average rate is pP. 
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Fig. 3. A Markov-modulated process for the aggregation of n homogeneous MMOOs 

When n such statistically independent sources are multiplexed together then the corresponding modulating Markov process, 
denoted with abuse of notation as Z(t) as well, has the states {0, 1, ... , n} and the transition rates as depicted in Figure [3] 
The cumulative arrival process for the aggregate flow is defined identically as for each sub-flow, i.e., 

A(s,t) := [ Z{u)Pdu V0 < s < t , A(t) := A(0,t) . 



Note that, by definition, A(s,t) is continuous. 
A. Martingale Bounds 

Recall our main goal of deriving bounds on the distribution of the through aggregate's delay process Wi(t) for the FIFO, SP, 
EDF, and GPS scheduling scenarios in the network model from Figure Q] We start this section with a general result enabling 
the analysis of all four scheduling scenarios, and then analyze each separately. 

Theorem 1: (MARTINGALE Sample-Path Bound) Consider the single-node queueing scenario from Figure Q] in which 
n sub-flows are statistically independent MMOO processes with transition rates /i and A, and peak rate P, and starting in the 
steady-state. The aggregate arrival processes are Ai(t) and A 2 {t), each being modulated by the (stationary) Markov processes 
Zi(t) and Z2{t) with n\ and n 2 states, respectively, with n\ + n 2 = n. Assume that the utilization factor p := 2— satisfies 
p < 1 for stability, where p is the steady-state 'On' probability; assume also that P > c to avoid a trivial scenario with zero 
delay. Then the following sample-path bound holds for all < u < t and a 

sup {A 1 {s,t-u)+A 2 {s,t) -C(t-s)} > a 

0<s<t-u 

< /v"e- 7(ClU+<T) , (7) 



where C\ = n\c, K = p ( ) P , and 7 = 



F-c 



The theorem enables the per-flow analysis for several common scheduling algorithms: FIFO, SP, EDF, and GPS (see 
Subsections IIV-AlTlIV-A4l ). In this way, the theorem considerably generalizes a result by Palmowski and Rolski ll37l . restricted 
to an aggregate analysis under FIFO. The key to our proof is the construction of a single martingale M t , from two existing 
martingales ll37l . such that the per-flow analysis for the different scheduling algorithms becomes possibleQ. The sample-path 
bound from Eq. (0 then follows from a standard technique based on the Optional Sampling theorem, applied to the martingale 
M t ; for relevant definitions and results related to martingales we refer to the Appendix. 

Proof: Fix u > and a. For convenience, let us bound the probability from Eq. ([7) by shifting the time origin and using 
the time-reversed representation of arrival processes described in Section UTH i.e., 



sup{A r 1 (u,t)+A 1 2 {u,t)-C(t-u)} + A I 2 (u)-C 2 u>C 1 u + a) , (8) 

t>u J 

where C2 = n^c. This representation is possible because the underlying Markov modulating processes of A\(t) and ^(i), i.e., 
Z\(t) and Z 2 {t), respectively, are time-reversible processes (see, e.g., Mandjes ||33l . p. 57); the reversibility is a consequence 
of the fact that Z\(t) and Z 2 it) are stationary birth-death processes (see Kelly lF24l . pp. 10-11). Denote by Z\(t) and Z r 2 (t) 
the time-reversed versions of Zi(t) and Z 2 (t), respectively. 

Given the previous probability event we define the stopping time 

T := inf {t > u : A[(u, t) + A\{u, t) - C(t - u) + A T 2 (u) - C 2 u > Cm + a} . (9) 

This construction is motivated by the fact that P(T < 00) is exactly the probability from Eq. (H). The goal of the rest of the 
proof is to bound P (T < 00). 

Let Pj ; j denote the underlying probability measure conditioned on Z\(u) = i and Z 2 (0) = j, for < i < n\ and < j < n 2 . 
Next we define the following two processes 

Mx(t) := e-^M-'V/^M-co* V< > u and 
M 2 {t) := e -»(^(*)-J') e 7/ t (P^W-C 2 )d s yt > ) 



A similar development is possible using other martingale constructions by Duffield [18] or Chang |7], pp. 339-343. 
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where 9 := log j^-f^- Note that 9 < due to the stability condition p < 1. 

According to Palmowski and Rolski (37), both M\(t) and M 2 (t) are martingales with respect to (wit) Pj j and the natural 
filtration (for the original result see Ethier and Kurtz |fl9l , p. 175). Moreover, according to Lemmas [2] and Q] from the Appendix, 
the following process 

' M 2 (t) , t<u 

M 1 (t)M 2 (t) , t > u 

is also a martingale (note that Mi (it) = 1, by construction). 

Because T may be unbounded, we need to construct the bounded stopping times T A k for all k € N. For these times, the 
Optional Sampling theorem (see Theorem [3] in the Appendix) yields 



E M [A/ ] = E,, m 



TAfcJ 



for all k G N, where the expectations are taken wit Pjj. Using Eij [Mq] = 1 and according to the construction of M 2 (t) we 
further obtain for k > u 



1 > E iA [M TAk h 
-C1+C2 

■,3 



> e -e{^-{i+o)) e i(Cu+a)^. (T < k) ; 

where denotes the indicator function. The first term in the product follows from 9 < and 

(Z[(T) + Z 2 (T)) P > d + C 2 , 

according to the construction of T from Eq. (O and the continuity property of the arrival processes. The second term follows 
from 7 > and the construction of T. 

By deconditioning on i and j (note that Z[ (it) and Z l 2 (0) are in steady-state by construction) we obtain 



>-j 



Using the identities 

£ e *(*-) = and £ e e (^) - if" 2 
i=i i=i 

(see f37l ) and taking fc->oowe finally obtain that 

P(T < 00) < A""e- 7(Cltl+CT) , 

which completes the proof. □ 

In the following we fix < d < t and derive bounds on P (Wi (i) > d) for FIFO, SP, EDF, and GPS scheduling; the 
derivations follow more or less directly by instantiating the parameters of Theorem Q] for each scheduling case. 

1) FIFO: The FIFO server schedules the data units of Ai(t) and A 2 (t) in the order of their arrival times. 

To derive a bound on the distribution of the through aggregate's (virtual) delay process W\(t), we rely on a service process 
construction for FIFO scheduling, as mentioned in Section [HI] We use the service process from Cruz |[T6l extended to bivariate 
stochastic processes, i.e., 

Si(*,t) = [C(t-s)-A 2 {s,t-x)] + I {t _ s>x} , (10) 

for some fixed x > and independent of s and t (for a proof, in the sightly simpler case of univariate processes, see Le Boudec 
and Thiran {3), pp. 177-178; for a more recent and general proof see Liebeherr et al. Il30ll ). By notation, [y] + := max{y,0} 
for some real number y. 

Using the equivalence of events 

Wi[t) >d^A 1 {t-d) >D 1 {t) , 

and also the service process representation from Eq. (0, we can bound the distribution of W\ (t) as follows 

P(Wi(i) > d) 

< P (ill (i-d) >A 1 *S 1 (t)) 

= P( sup {A 1 (s,t-d)-[C(t-s)-A 2 (s,t-x)} + / {t _ s>x} } > 0) . (11) 

V 0<s<t-d > 

Here we restricted the range of s from [0,t] to [0,t — d), using the positivity of the "[•]+' operator and the monotonicity of 
Ai{s,t). 
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Because x is a free parameter in the FIFO service process construction from Eq. ([Tol l, let us choose x = d. With this choice 
it follows from above that 

P(Wi(t) > d) 

<P( sup {Ai{s,t-d) + A 2 {s,t-d) -C(t-s)} > 

\0<s<t-<i 

By applying Theorem Q] with u = and cr = Cd, we get the following 
Martingale Delay Bound (FIFO): 

W>(Wi(t) > d) < K n e^ cd , (12) 

where K and 7 are given in Theorem Q] Note that the bound is invariant to the number of sub-flows m, which is a property 
characteristic to a virtual delay process (for FIFO); such a dependence will be established by changing the measure from a 
virtual delay process to a packet delay process (see Section IIV-Eb . 

2) SP: Here we consider an SP server giving higher priority to the data units of the cross flow A 2 (t), We are further 
interested in the delay distribution of the lower priority flow; the case of the higher priority flow is a consequence of the 
previous FIFO result. 

We follow the same procedure of first encoding Ai(t)'s service view in a service process, e.g., (see Fidler lEOl ). 

S 1 {s,t)=C{t-s)-A 2 {s,t) . (13) 

now in the case of SP scheduling. 

To bound the distribution of W\ (t) we continue the first two lines of Eq. ( fTTT i as follows 

P(Wi(t) > d) 

<P( sup {A 1 (s,t-d)+A 2 (s,t)-C(t-s)}>0 

\0<s<t-d 

By applying Theorem Q] with u = d and a = 0, we get the following 
Martingale Delay Bound (SP): 

v(Wi(t) >d\ < K n e~ lCld , (14) 

where K and 7 are given in Theorem Q] Note that, as expected, the SP delay bound recovers the FIFO delay bound from 
Eq. ( fT2l when there is no cross aggregate, i.e., in the case when C\ = C. 

3) EDF: An EDF server associates the relative deadlines d\ and d^ with the data units of Ai(t) and A 2 {t), respectively. 
Furthermore, all data units are served in the order of their remaining deadlines, even when they are negative (we do not 
consider data unit losses). 

A service process for A\ (t) is for some x > 

Si(s,t) = [C(t-s) - A 2 (s,t - x + mm{x,y})] + I {t _ s>x} , (15) 

where y = d^ — d\ (see Liebeherr et al. l30l ). This service process generalizes the FIFO one from Eq. ( fTOb (which holds for 
y = 0, i.e., the associated deadlines to the flows are equal), and it also generalizes a previous EDF service process by Li et 
al. Il28l (which is restricted to x = 0). 

To derive a bound on P (W\(t) > d), for some d > 0, let us first choose x :~ d, as we did for FIFO. Next we distinguish 
two cases depending on the sign of y. 

If y > then the continuation of Eq. ( fTTT ) is 

P(Wi(t) > d) 



< 



( sup {Ai(s,t - d) + A 2 (s,t - d + min{d, y}) - C(t - s) I > 0) 



By changing the variable t t + d — min{d, y} we get 

P(Wi(t) > d) 

<P( sup {Ai(s,t-mm{d,y}) + A 2 (s,t)-C(t-s + d-mhi{d,y})}>o) . 

\a<s<t-min{d,y} J 

(we point out that as we are looking for the steady-state distribution of W\ (t), we can omit the technical details of writing 
Wi(t + d — min{(f, y}) above.) We can now apply Theorem Q] with u = min{d, y} (note that both d and y are positive) and 
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a = C(d — min{d, y}), and get the following 
Martingale Delay Bound (EDF) {d\ > d* Case): 

p(Wi(t) > d\ < K n e~ rC2min{d i- d i' d} e-~ fCd , (16) 

where K and 7 are given in Theorem Q] 

The second case, i.e., y < 0, is slightly more complicated. The reason is that mm{d,y} = y (see Eq. ( fT31 l) such that the 
continuation of Eq. (TTTI i becomes 

sup {^(s.t-d)- [C(*-s)- J 4 2 (s,t-d + j/)] + / { t_ s>d} } >0) . (17) 

0<s<t-d / 

Note that when sE [4 — d+ y, t — d), then one must consider A 2 (s, t — d + y) := according to the conventions from 11301 . 
Therefore, one must perform the splitting [0, t — d) = [0, t — d + y) U [t — d + y, t — d); thereafter, by changing the variable 
t <— t + d, the continuation of Eq. ( fTTI i is 

<p(maxi sup {A 2 (s,t + y)+Ai(s,t)-C(t-s)}, sup {Ax(s, t) - C(t - s)} \ > Cd 



<P sup {A 2 (s,t + y) + A 1 (s,t)-C(t-s)}>Cd +P sup {A x (s,i) - C(t - s)} > Cd 

\0<s<t+y J \0<s<t 

In the third line we applied the Union Bound [sic], which is conceivably tight because the two elements in the 'max' are 
rather uncorrelated. Moreover, we extended the left margin in the last supremum (in the fourth line), as we are looking for 
upper bounds, whereas the martingale argument from Theorem Q] is insensitive to where the left margin starts. 

The last two probabilities can be directly evaluated with Theorem Q] For the first one we set u = —y (note that y is now 
negative) and a = Cd. For the second one we set u = 0, n-2 = 0, a = Cd, and we properly rescale the per-flow capacity c 
and utilization factor p (see below). In this way get the following 
Martingale Delay Bound (EDF) (d* < d? 2 Case): 



?(Wi{t) > d) < K n e lC ^ d *- d *~ ) e-~ <Cd + K' n e- Ycd 



(18) 



with the same K and 7 from Theorem Q] whereas K' and 7' are obtained alike K and 7, but after rescaling d ^— " 1+ " 2 c and 
d = - J ^—p- 

Note that the first EDF bound from Eq. ([TBI recovers the FIFO bound when the associated deadlines are equal, i.e., when 
d* = d 2 - In turn, the second EDF bound from Eq. (fTSI l would also recover the FIFO bound, but only by dispensing with the 
unnecessary splitting of the interval [0, t — d) since y = 0. 

4) GPS: We consider a GPS server (see Parekh and Gallager ll38l . or Demers et al. ifTTl ) which assigns positive weights <fii 
and (j>2, normalized here such that 4>\+ 4>2 = 1, to the flows A\{t) and A 2 (t), respectively. Denoting by D\(s, t) and I?2(s, t) 
the corresponding departure processes, GPS guarantees that for every continuous backlogged flow i in a time interval [s,t), 
the following holds 

Di(s,t) & 

777— 7T >-forj^. (19) 

One service process for the flow A\{t) is (see Chang Q, p. 68) 

Sx{s,t) = cj )1 C{t-s) , (20) 

which corresponds to the minimum service guarantee by the GPS property from Eq. ([T9V Unfortunately, this service process 
does not capture the full server capacity allocated to the flow A\(t) when the other flow is not backlogged, and it thus 
conceivably leads to loose bounds. 

An improved service process was constructed by Li et al. l28l . but it requires an additional concavity assumption on the 
flow A 2 (t); therefore, it does not apply in our setting. An improvement in the general case (i.e., for any types of arrivals) can 
be obtained when the SP service process from Eq. (fT3~t is larger than the one from Eq. (l20t : note that the SP service process 
holds by default. The improvement can be substantial for small values of <f>\. Indeed, in the extreme case when <f>\ = 0, the 
GPS service process from Eq. (l20l would predict infinite delays as the system would be (wrongly) perceived in overload; in 
turn, by relying on the SP service process, finite delays can be obtained. 

Nevertheless, despite the pessimistic outlook of relying on the service process from Eq. (l20l . we will compute the Martingale 
and Standard bounds on the distribution of Wi(t). The reason is to illustrate, by means of comparisons with simulations, that 
the service process from Eq. (l20t lends itself to loose bounds, even in 'favorable' rate proportional processor sharing (RPPS) 
scenarios such as n\ = ri2 and <j>\ = <$> 2 (see Section IIV-Eb . 
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To derive the Martingale delay bound, we continue the first two lines of Eq. ( fTTb as follows 

P(Wi(t) > d) 

<P[ sup {A 1 (s,t-d)~(f> 1 C(t-s)}>0 

\0<s<t-d 

Next, by applying Theorem Q] with C := <j)\C, u := d, a := <j)\Cd, and n-i := 0, in this order, we obtain the following 
Martingale Delay Bound (GPS): 

v(Wi(t) > d) < if» e -T^iCd ; (2 1) 
where K and 7 are given in Theorem [1] note that the new utilization factor is p := n j^Q ■ 
B. Standard Bounds 

Here we briefly review the standard (per-fiow) delay bounds obtained with SNC for FIFO, SP, EDF, and GPS scheduling 
algorithms. These bounds will be compared, both analytically and numerically, against the Martingale bounds computed so far. 

We assume that for each MMOO sub-flow A^(t) the corresponding Markov process Z{t) (with two states, from Figure |2]l 
starts in the steady-state, i.e., 

P (Z(0) = 0) = 1 - p and P(Z(0) = 1) = p , 

where p was defined earlier, i.e., p = jq^j- The computation of the Standard delay bounds relies on the moment generating 
function (MGF) of Ao(t), which can be written for all t > and some 9 > as (see Courcoubetis and Weber iTPTl ) 



E 



e 8A Q (t) 



I Sr' t 1 9rnt 

we e + we 6 



where w> = j^feQ , w = ^gg^ , rj = r e = 6 = A + M - 6P, and A = ftf + 4^P. Since 

iy' + w = 1 and r' g < rg, it follows that 

^ < e 0ret , (22) 



e eA (t) 



which is the typical approximation of the MGF in the SNC literature by a single (dominant) exponential. The rate rg corresponds 
to the effective bandwidth; it is non-decreasing in 9 and satisfies 

pP < rg < P , 

i.e., rg is between the average and the peak rate of a single MMOO process. 

The next result gives a common sample-path bound which will be used to compute the Standard delay bounds for all three 
scheduling algorithms; the result parallels the one from Theorem [T] which was used for the Martingale bounds. 

Theorem 2: (STANDARD S AMPLE-PATH Bound) Consider the same hypothesis as in Theorem [T] and in addition assume 
that the modulating processes of the sub-flows start in the steady-state. Then the following sample-path bound holds for all 
< u < t and a 

sup {Ax{s,t - u) + A 2 (s,t) - C'(t - s)}> a 

0<s<t-u 

< inf Le -f>(C-n 2 r )u e -6a ^ (23) 
{9:c>r e } 



where L = and rg was defined prior to Eq. 

The proof proceeds by first discretizing the sample-path event and then by using standard arguments in SNC and effective 
bandwidth theory based on the Union and Chernoff bounds (i.e., the second and third bounding steps discussed in Section HIT). 
Similar proofs have been given for various sample-path events (see, e.g., ifTTl ). 

Proof: Fix < u < t, a, and 9 > such that c > rg. Consider the free parameter tq > for discretizing the 
event from Eq. (f23) at the points j = [ *~"~ s J for all < s < t. Using the monotonicity of the arrival processes (e.g., 
Ai(s, t — u) < Ai{t — u — (j + 1)t , t — u)), we can bound the probability event from Eq. ((23) by 

|J {Ai(t - u - (j + 1)to, t - u) + A 2 (t - u - (j + 1)to, t)-C(t-(t-u- jro)) > a} 

<^2f(Ai(t-u-jT ,t-u) + A 2 (t-u-jTo,t)-C(u + jT ) > -Ct + a) 

< e -6(C-nre)jTo e -6(C-n2re)u e eCTo e -8o- 

i>i 

p SCt 

< e e -e(C-n 2 r e )u e -9(T 

~ 9(C - nrg)T 
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The derivations relied first on the Union Bound, then on the Chernoff bound applied to the MGF bound from Eq. (l22t . and 
finally on the inequality X^>i e_a ' 7 — a f° r some a > 0. Since r is a free parameter, the last bound can be optimized with 
T ° = Jc- Fi na lly-. taking the minimum over 9 completes the proof. □ 

Next we list the standard bounds on Ai(t)'s virtual delay for FIFO, SP, EDF, and GPS scheduling. They are obtained alike 
the Martingale ones from Eqs. (fTZt . (fl4t . and (|2TT >. except that the sample-path bound from Theorem Q] is replaced by the one 
from Theorem [2] 



FIFO : 


■ ■ ■ < 




inf Le~ ecd 




(24) 






{< 


■ c>rg} 




SP : 


■ • ■ < 




inf L e -»(C-» 2 r e )d 




(25) 






{t 


■ Org} 




EDF 1 : 


■ • ■ < 




inf Le en2r " min { d i _ 


d£,d} -$Cd 


(26) 






{i 


■ c>r e } 




EDF 2 : 


• • < 




inf L( J>(c-n ire )(di 


-d* 2 ) e -ecd + inf L , e -ecd 


(27) 






{i 


■ Org} 


{9:C>rg} 


GPS : 


• • • < 




inf Le- 9 ^ Cd 




(28) 






{< 


:<j>iC>mre} 





where L is given in Theorem [2] for FIFO and SP, and L = x^i^r for GPS. We mention that EDF 1 corresponds to the case 
d* > d\ ( see tne Martingale bound from Eq. ([Toll), and EDF 2 to the complementary case (see Eq. (fT8l). Moreover, for EDF 2 , 
d is the rescaled value of c (as in the Martingale bound from Eq. ([T8~t). whereas L' is defined like L but with c replaced by 
d. We also point out that the GPS bound is roughly the same as the one computed by Zhang et al. 11451 . for more general 
arrival classes and without using the service process concept (which was still to be properly formalized by Cruz lfj"5l a year 
after). 

C. Asymptotic Decay Rates Comparison 

Here we compare the asymptotic decay rates (denoted by 77) of the Martingale and Standard bounds on P(Wi(£) > d), 
which can be isolated by taking the limit 

logP(Wi(t)>d) 
T] := — lim . 

d— >oc d 
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TABLE I 

Asymptotic decay rates forP (Wi(t) > d); for FIFO, EDF, and GPS: 7 = 8* 

Table U lists the asymptotic decay rates for FIFO, SP, EDF, and GPS scheduling. For the Standard bounds, the 6*'s are the 
solutions to the optimization problems from Eqs. (I24l -(l28l. respectively. Next we discuss the bounds by grouping them into 
two groups. 

1) FIFO, EDF, and GPS: We only focus on FIFO; the other two are analogous. 
For the Standard FIFO bound, 9* is the unique solution of the equation 

re = c, (29) 

where re is the effective bandwidth from Eq. (l22l . This is a standard result in the effective bandwidth literature (see, e.g., 
Glynn and Whitt ED or Chang Q, p. 291). 

Next we show that 6* = 7, i.e., the asymptotic decay rates are equal. To this end, we first recall from ifPJI that rg is derived 
by solving the eigenvalue problem 

Q fl x = Celx (30) 

where x stands for the eigenvector and 

Qe = ( A* -X + P6 )' I:= ( 1 ) ' 
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for some 8 > 0. If (g denotes the spectral radius of Qg (corresponding to u>2 in |[T3l , Section 3), then the effective bandwidth 
is defined as r$ = % (see Eq. d22l)). 

In turn, 7 from the Martingale bound satisfies the generalized eigenvalue problem (see J37)) 

Q y = 7Dy, (31) 

where y stands for the eigenvector and 

D: =(o ° 

V c — p 

Using the positivity of 7 (see Theorem [TJ and applying the Separation Lemma from Sonneveld ll43l . it follows that 7 is the 
spectral radius. Furthermore, by relating the eigenvalue problems from Eqs. (l30t and (|3H through the equality 

Q 7 - C7I = Q - 7D , 



it follows that cj = £ 7 . Finally, since 8* is the unique solution of Eq. (I29l l. whereas rg* = by definition, it follows that 
8* = 7. This result is particularly important as it illustrates an explicit solution for the effective bandwidth equation from 
Eq. @5). 

2) SP: Unlike in the FIFO case, there is no immediate explicit solution for the optimal value 8* . Because c = r 7 , as shown 
above, we can observe that 

7C1 = 7 (C - n 2 r 7 ) , 

which indicates a similarity in the two asymptotic decay rates. Unfortunately, due the form of the optimization problem from 
Eq. (l25l l. we will solely resort to numerical comparisons between the Martingale and Standard bounds. 

In conclusion, in large buffer (or, here, delay) asymptotic regimes, FIFO, EDF, and GPS exhibit the same tail behavior for 
both the Standard and Martingale bounds; according to numerical results, the same holds for SP (see Section IIV-EI . Next we 
consider many-sources asymptotics, which are also commonly discussed in the literature (e.g., Mazumdar 1134) ). 



D. Many-Sources Asymptotics Comparison 

We consider the following scaling scheme: the total number of flows n is scaled up, whereas the rest of the parameters, i.e., 
the utilization factor p, the per-flow rate ro = pP, and the per-flow capacity c remain unchanged. 

Let us first observe that the factors K (defined in Theorem [TJ and L (defined in Theorem |2]l from the two sets of bounds 
satisfy 

K < 1 and L > 1 . 

The second property is immediate. In turn, for the factor K, note that < 1 and the functions f(x) := f5f an d 9( x ) : = 
are non-increasing on x G (0, p). Thus, by the composition of the power function (defined on the interval (0,1)) and the 

x — p 

exponential function, the function h{x) := I y^f ) P is non-increasing. Moreover, lim^o h{x) = p^ 1 , and thus K < 1. 
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TABLE II 

Scaling laws for the Martingale and Standard bounds onP(Wi(£) > d) in the total number of flows n = n\ + n 2 ; a and rj are 

INVARIANT TO n 



Table |TT] illustrates the scaling laws of the Martingale and Standard delay bounds for the four scheduling algorithms. The 
factors a > and r\ > are invariant to n and can be fitted for each individual case; e.g., in the case of FIFO, a = — log K and 
77 = 7c. We remark that all pairs of bounds have the same asymptotic decay rate 77. The critical observation is that, unlike the 
Standards bounds, the Martingale bounds have an additional factor e~ an decaying exponentially with n. This scaling behavior 
was indicated by Choudhury et al. J9] by numerical evaluations. We point out that (9] further indicated an additional factor 
j3 > 0, invariant to n, which is however not captured by the Martingale bounds. 

E. Numerical Comparisons 

In this section we compare the Martingale and Standard bounds against simulations and also in asymptotic regimes. The 
parameters of a single MMOO source are A = 0.5, p — 0.1, and P = 1 (the average 'Off period is five fold the average 'On' 
period). 
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1) Bounds vs. Simulations: We consider two utilization levels (p = 0.75 and p = 0.9), and two degrees of multiplexing 
(ni = n,2 = 5 and ni = n 2 = 10). The packet sizes in a packet-level simulator are set to 1; fractional packet sizes are 
additionally set when the dwell times in the states of the Markov process from Figure [3] are not integers. The simulator 
measures the delays of the through flow's first 10 7 packets, and it discards the first 10 6 . For numerical confidence, 100 
independent simulations are being run and the results are presented as box-plots. 

For the soundness of the comparisons against simulations, it is important to remark that the delay analysis so far concerned 
the virtual delay process W\(t), which corresponds to the delay of a through flow's infinitesimal unit, should it depart, or 
equivalently arrive, at time t; more concretely, we note that the bounds computed with SNC on virtual delays are identical, 
should they concern a virtual arrival or departure unit. In the packet level simulator, however, it is the packet delay process which 
is being measured, and which is denoted here by W\ (n) (the index 'n' corresponds to the packet number for the through flow). 
Therefore, one has to properly perform a suitable change of probability measures in order to provide meaningful numerical 
comparisons. 

We follow a Palm calculus argument and relate the measure of the virtual delay process to that of the packet delay process 
(see Shakkottai and Srikant fiTl ). For convenience, we work in reversed time and focus on time where steady-state is assumed 
to be reached. Denoting W\ := Wi(0), we can write by conditioning 

¥(Wi>d) = ¥{Wi > d | ai(0) > 0)P(ai(0) > 0)+P(Wi > d | oi(0) = 0)P(ai(0) = 0) 
> P(Wi>d|oi(0)>0)P(oi(0)>0) 

= P (Wi > d)p(oi(0) > 0) , (32) 

where ai(0) denotes the instantaneous arrivals of the through flow at time 0, and W\ denotes the steady-state packet delay 
process of the through flow. Note that for the inequality we eliminated the second term in the sum above. 
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Therefore, 



< 



' (Wi > d) 



(33) 



l-(l-p)* 

(Recall that p is the steady-state 'On' probability of the MMOO process from Figure [2]) 

Below we compare the distribution of W\ against the one of the measured (simulated) delay process. All the Martingale and 
Standard bounds which we compute for FIFO, SP, EDF, and GPS scheduling are scaled up by the additional prefactor from 
Eq. d33l needed for the change of measure. We note that this scaling is conservative because of the inequality from Eq. (|32| ). 

Figure |4] illustrates the comparisons for FIFO scheduling (recall Eqs. (fT2l) and d24l) for the Martingale and Standard delay 
bounds, which are scaled as in Eq. d33}): the y-axis uses a log scale. The irregular tail behavior (including the presence 
of many outlierJTIat p = 90%) of the box-plots is due to the restriction of the simulation runs to 10 7 packets. All four 
scenarios, corresponding to various utilizations and multiplexing, clearly indicate that the Standard bounds are very loose, as 
they overestimate the simulation results by a factor of roughly 10 2 at 75% utilization (see (a) and (c)), and even 10 3 at 90% 
utilization (see (b) and (d)). In turn, the Martingale bounds are reasonably accurate. We suspect that the slight loss of accuracy 
for ri\ = n 2 = 5 (in (a) and (b)) stems from the conservative change of measure from Eq. ( 132k indeed, at n\ = n 2 = 10 
(in (c) and (d)) one can notice an increase in accuracy due to the lesser role played by the change of measure prefactor from 
Eq. 03}. 
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The same observations hold for SP scheduling, as indicated by Figure |5J recall the Martingale and Standard delay bounds 
from Eq. dT4t and d25} . respectively, which are again scaled as in Eq. d33t . Moreover, note that the SP delays increase roughly 

2 Outliers are depicted in the box-plots with the '+' symbol; on each box, the central mark is the median, and the edges of the box are the 25th and 75th 
percentiles. 

3 The long stretch of the box-plots and the presence of many outliers is caused by the choice of 10 7 arrivals, in order to illustrate the need for very long 
simulation runs (e.g., 10 8 arrivals, in which case the box-plots would significantly shrink and most outliers would disappear). 
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by a factor of 2 relative to the FIFO delays, due to the setting n\ = ri2\ the same factor predominates at higher multiplexing 
regimes as well. While this is an indication that scheduling can matter, we refer to Section HV-E3I for a specific asymptotic 
scenario (i.e., with many flows) in which scheduling does not matter. 
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Fig. 6. EDF delay bounds (rti = «2 = 10, p = 75% in (a) and (c), and p = 90% in (b) and (d)) 



The tightness of the Martingale bounds, in contrast to the looseness of Standard bounds, further holds in the case of EDF 
scheduling, for both cases (i.e., d\ > and d* < d%), as illustrated in Figure [6] recall the Martingale and Standard delay 
bounds from Eqs. ([T6]i-(fT8]i and Eqs. d26t-<f27T>. respectively. Note that the bending of the curves, e.g., in (a), is due to the 
choice of d* and d^: the bounds behave like the SP bounds for d < d\ — d%, and asymptotically like the FIFO bounds thereafter. 

The previous numerical illustrations conceivably indicate the tightness of the 'proprietary' SNC bounding step from Eq. (fl3, 
in the case of FIFO, SP, and EDF scheduling. More concretely, the state-of-the-art service processes for these scheduling 
algorithms (see Eqs. ( [Tol l. ( fT3l l. and ( fT5l l) appear to be tight, as long as they are used in conjunction with the 'right' tools from 
probability theory. The same illustrations also confirm the pitfall identified in Section [Til concerning the potential looseness of 
the Union Bound in the case of bursty processes. 

The positive side of our exposition so far (i.e., the Martingale bounds fix the very loose Standard bounds) is disturbed, 
however, in the case of GPS scheduling (with 4>i ~ 4>2 — -5). Indeed, let us refer to Figure [7] showing the corresponding 
numerical comparisons (recall the Martingale and Standard bounds from Eqs. (ETT i and (l28ll): we mention that the packet-level 
simulator implements WFQ scheduling, i.e., the packetized version of GPS (see Keshav [2611 . pp. 238-242). Unlike in the FIFO, 
SP, and EDF cases, the Martingale bounds are now very loose. This indicates that, in addition to the Union Bound bounding 
step, the 'proprietary' SNC bounding step from Eq. ((T) can also be very loose. In other words, the service process for GPS 
scheduling from Eq. d20b needs to be significantly improved. 

2) Asymptotic comparisons: Here we illustrate the bounds in a many sources asymptotic regime: we fix the target delay bound 
d = 5 and plot the per-flow violation probabilities P (W\ > d^j by increasing the total number of flows n. For visualization 
purposes we restrict to FIFO, SP, and one case of EDF. 
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The important observation from Figure [8]is that, for all settings (e.g., FIFO, p = 75%), the Standard and Martingale bounds 
diverge. In other words the discrepancy between the Standard and Martingale bounds grows arbitrarily large (e.g., by a factor 
of 10 9 at n = 10 3 ). This observation supports the scaling laws from Table [U] In the light of this rather pessimistic evidence 
concerning the Standard bounds, the immediate question is whether they can still have any practical relevance; next we clarify 
this concern. 

3) The Standard bounds can (sometimes) be useful: Here we show that, despite the poor scaling in multiplexing regimes, 
the Standard bounds can be practically relevant. To this end, we consider the following connection admission control problem: 
given the server capacity C, the target (per-flow) delay bound d, and a violation probability e, determine the maximum number 
of flows n to meet all the three constraints. 

Figure [9] shows the achievable utilization (i.e., p = ^f-) as a function of the capacity C for FIFO and SP, two delay targets 
d = 1 and d = 10, and two violation probabilities e = 10~ 3 and 10 -9 . The important observation is that once d and e are both 
fixed, then by increasing the capacity C (i.e., 'making room for sufficient statistical multiplexing to kick in') the Martingale 
and Standard bounds converge to one, albeit at different rates. 

The convergence for both FIFO and SP is a manifestation of the assertion from the literature that "scheduling has only a 
limited impact" (e.g., on admission control) (see, e.g., Li et al. l28l ); such an assertion, however, does not generally hold (see, 
e.g., Figures @] [5] and [SJ. 

We conclude that the Standard bounds can (sometimes) be nearly optimal and thus be practically relevant. This remark is 
typically presented in the literature in the form: the admissible region based on the Standard bounds converges to the admissible 
region based on averages (see, e.g., Boorstyn et al. J3 or Li et al. l28l ). Variations of the underlying scaling regime, enabling 
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such conclusions, are often adopted in the literature in order to expose the Standard bounds in a favorable light. 

V. Conclusions 

In this paper, we have put our finger in a wound of the stochastic network calculus: the lingering issue of the tightness of 
the SNC bounds. To some degree, this issue had been evaded by the SNC literature for some time although it is a, if not the, 
crucial one. In fact, we demonstrated that the typical (Standard SNC) way of calculating performance bounds results in loose 
delay bounds for several scheduling disciplines (FIFO, SP, EDF, and GPS) as well as for various multiplexing regimes. This 
becomes particularly obvious when comparing the (Standard SNC) analytical results to simulation results, where discrepancies 
up to many orders of magnitude can be observed. So, we strongly confirm the often rumored conjecture about SNC's looseness. 

Yet, the paper does not stop at these bad news, but in an attempt to understand the problems of Standard SNC, which mainly 
lie in not properly accounting for the correlation structure of the arrival processes (by coarse usage of the Union bound), we 
find a new way to calculate performance bounds using the SNC framework based on martingale techniques. Here, SNC still 
serves as the "master method", yet the Union bound is substituted by the usage of martingale inequalities, to make a long 
story short. Comparing the new Martingale SNC bounds to the simulation results shows that they are remarkably close in 
most cases, which rehabilitates the SNC as a general framework for performance analysis. So, the SNC can arguably still be 
regarded as a valuable methodology with the caveat that it has to be used with the right probabilistic techniques in order not 
to arrive at practically irrelevant results. As usual, there is a "but" to such a general statement: this is the issue about GPS for 
which even the Martingale bounds stay loose (though again improving by orders of magnitude over the Standard bounds). This 
may be indeed a case where SNC as master method fails, because it might be wrong to separate the service process derivation 
from the arrivals of the flow under investigation. On the other hand, this is very speculative and it may still be possible to find 
a service process that tightly captures the GPS characteristics. We leave this interesting open issue for future work. 

Other related challenges include extensions to general Markov arrival processes (e.g., by using the general result from l37l ). 
self-similar arrival processes, and the multi-node case (e.g., by accounting for the martingale representation from fl6)). 
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Appendix 

Here we summarize some definitions and results related to martingales, which are needed in the paper. 

Consider a (continuous-time) stochastic process (Xt)t>o defined on some joint probability space (f2, J 7 , P). A filtration is 
a family {Tt ■ t > 0} of sub-cr-fields of T such that T s C Tt for all < s < t. We are particularly interested in the natural 
filtration generated by X t , i.e., T^ := a (X s : s < t). 

Definition 1: (STOPPING Time) A stopping time, wrt a filtration Tt, is a non-negative random variable T such that 

{T < t} e T t Vi > . 
In this paper we are particularly interested in the first passage time 

T = inf {X t > x} , 

t>Q 

for some non-negative x. To avoid technical considerations related to conditions under which T is a stopping time, we assume 
throughout that the sample-paths X t (u) are right-continuous Vcj G il; this assumption is implicitly fulfilled by the definition 
of the arrival process from Eq. (|6). 

Definition 2: (MARTINGALE) A continuous time process X t is a martingale wrt the natural filtration Tf- if 

1) E [\ X t |] < oo Vi > and 

2) E [X t | T s ] = X s V0 < s < t . 

The next three results are needed in the proof of the main theorem in the paper. 

Lemma 1: (OPTIONAL SWITCHING) Consider that X t and Y t are martingales wrt T := Tf' Y , and assume that X u = Y u 
for some u > 0. Then the process 

X t , if i < u 



Y t , if i > u 
is a martingale wrt T. 

For the corresponding result in discrete time see Grimmett and Stirzaker 11221 . p. 488. 
Proof: According to the martingale definition, the non-trivial property to prove is 

E [Z t | J- s ] = Z s , 

for t > u and s < u. Indeed, we have according to the tower property of conditional expectation 

E [Z t | F s ] = E[E [Z t | F u ] | F s ] 

= E[Y U \ j;] = E [X u | F s ] 
= Z s , 

which completes the proof. □ 
Lemma 2: (Product of Independent Martingales)!!) Consider that X t and Y t are independent martingales wrt 

T := T t ' ■ Then X t Y t is a martingale wrt T . 

Theorem 3: (OPTIONAL SAMPLING THEOREM)(see £19], p. 61) If X t is a right-continuous martingale and T is a finite 

stopping time wrt , then 

E [X T ] = E [X ] . 



